Using sample size to limit exposure to data mining

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Sample Size to Limit Exposure to Data Mining

Data mining introduces new problems in database security. The basic problem of using non-sensitive data to infer sensitive data is made more difficult by the “probabilistic” inferences possible with data mining. This paper shows how lower bounds from pattern recognition theory can be used to determine sample sizes where data mining tools cannot obtain reliable results.

متن کامل

Bayesian sample size Determination Using a Scaled Exponential Utility Function According to Numerical Method

‎In this paper we propose a utility function and obtain the Bayese stimate and the optimum sample size under this utility function‎. ‎This utility function is designed especially to obtain the Bayes estimate when the posterior follows a gamma distribution‎. ‎We consider a Normal with known mean‎, ‎a Pareto‎, ‎an Exponential and a Poisson distribution for an optimum sample size under the propose...

متن کامل

proposing a method to classify texts using data mining

today a significant part of available data is saved in text database or text documents. the most important thing is to organize these documents. one way to organize text documents is to classify them. to classify texts is to assign text documents to their actual categories. this has two main steps, i.e. feature- and learning algorithm selection. there have been several methods suggested to clas...

متن کامل

Determining the sample size required to compare vegetation and soil characteristics in two independent groups using effect size

Extended Abstract Background and objectives: One of the important steps in assessing rangeland vegetation is determining the sample size. Adequacy of sample size and its determination is always one of the main concerns of rangeland vegetation analyzer. There are two general methods for determining the sample size in rangeland science: graphic and statistical methods. In this study, the sample...

متن کامل

A Proposed Model to Identify Factors Affecting Asthma using Data Mining

Introduction: The identification of asthma risk factors plays an important role in the prevention of the asthma as well as reducing the severity of symptoms. Nowadays, the identification process can be performed using modern techniques. Data mining is one of the techniques which has many applications in the fields of diagnosis, prediction, and treatment. This study aimed to identify the effecti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Computer Security

سال: 2000

ISSN: 1875-8924,0926-227X

DOI: 10.3233/jcs-2000-8403